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■ Abstract 

Quantum theory is formulated as the only consistent way to manipu- 
late probability amplitudes. The crucial ingredient is a consistency con- 
straint: if there are two different ways to compute an amplitude the two 
answers must agree. This constraint is expressed in the form of functional 
equations the solution of which leads to the usual sum and product rules 
f—*) . for amplitudes. A consequence is that the Schrodinger equation must be 

^-j- ' linear: non-linear variants of quantum mechanics are inconsistent. The 

physical interpretation of the theory is given in terms of a single natu- 
QO ■ ral rule. This rule, which does not itself involve probabilities, is used to 

' obtain a proof of Born's statistical postulate. Thus, consistency leads to 

indeterminism. 
PACS: 03.65.Bz, 03.65.Ca. 
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■ 1 Introduction 

CJ^ 1 In 1946 R. T. Cox gave an argument showing that once degrees of probability 

are represented by real numbers there is a unique set of rules for inducti 
reasoning, that is, for reasoning under conditions of insufficient information 
p% ' The crux of the argument is a consistency requirement: if a probability can be 

^ . computed in two different ways, the two answers must agree. Cox expressed 

this consistency requirement in the form of functional equations, the solution of 
which showed that the rules for inductive reasoning coincide with the well-known 
rules of probability theory. The importance of this achievement is twofold: First, 
it legitimized viewing probability theory as an extended form of logic, a point 
of view that goes back to Bernoulli and Laplace, was arguably heldJpy Gibbs, 
and which, more recently, has been forcefully advocated by Jaynes.n Second, 
Cox's argument provides an explanation for the uniqueness of probability theory, 
for its inevitability; any modifications of the rules of probability theory will 
necessarily lead to inconsistencies, and therefore be unsatisfactory. 

This latter feature, the robustness of probability theory, is also shared by 
quantum theory. The quest to explain the strange behavior of quantum systems 
has, since the beginning, led to all sorts of attempts to modify the theory. Two 
of the most central attributes of quantum theory, indeterminism and linearity, 
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have been the target of many such unsuccessful attempts. There has been con- 
siderable progress on issues related to .-the possibility of hidden variables and 
to the nature of statistical correlations. u Linearity violations have been consid- 
ered as a means to resolve the difficulties associated with the quantum mechanics 
of macroscopic objects. o Some authors have been motivated by the fact that 
many linear physical theories are mere approximations to more fundamental 
non-linear theories, 13 while others were led either by the desire to test quantum 
mechanics ever, more stringently, O or just to explore the curious implications of 
non-linearity. El Such extensive theoreticalJnvestigations have prompted several 
increasingly precise experimental testsH'H which have confirmed, at least for 
the time being, the robustness of quantum mechanics. 

In this work we propose an approach to quantum theory using ideas inspired 
by Cox's, although in a very different context. The result is the standard quan- 
tum theory. <&• El The crux of our argument is also a consistency requirement: if 
a probability amplitude can be computed in two different ways, the two answers 
must agree. This requirement is expressed in the form of functional equations, 
the solution of which leads to the usual sum and product rules for quantum 
probability amplitudes. In other words, quantum theory emerges as the unique 
way to manipulate probability amplitudes consistently. 

Next we obtain two important consequences. The first is that the equation 
for time evolution, the Schrodinger equation, is necessarily linear. The implica- 
tion is that the question of whether non-linear versions of quantum mechanics 
are at all possible should not be posed at the dynamical level of the Schrodinger 
equation but rather at a much deeper kinematical level requiring a reexamina- 
tion of the use and utility of the concept of amplitude. 

The second result addresses the issue of how does the knowledge of the 
numerical value of an amplitude assist us in predicting the outcomes of exper- 
iments. This question of the physical interpretation of an otherwise abstract 
formalism is handled by proposing a very natural general rule which applies to 
situations in which the result of an experiment is predicted with certainty. Using 
this rule, which involves no probabilities, we obtain a proof of Born's statistical 
postulate. The implication here is that a quantum theory formulated in terms 
of consistently assigned amplitudes must be indeterministic. 

These two results, the proof of linearity and of Born's probability interpre- 
tation, are not new. They have beeiL.anticipated within various axiomatic ap- 
proaches to quantum mechanics. O" 113 For example, the fact that Born's postwu- 
late is actually a-theorem wasiridependently discovered long ago by Gleason,M 
by Finkelstein,li3 by HartlellHl and by Graham. O What is new here is the 
manner in which the results are obtained; the new element is the emphasis on 
consistency in a formulation where amplitudes, rather than states or observ- 
ables, are the central concept. From the pedagogical point of view them. is an 
advantage in sharing the intuitive appeal of Feynman's path integrals. O Ax- 
iomatic methods, on the other hand, tend to be considerably more abstract and 
mathematically sophisticated; this is not in itself a defect but it does hinder 
their accessibility. 

To limit the risk that too general a treatment might obscure the simplicity 
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of the main ideas we will focus our attention on a simple example: a particle 
with no spin or other internal structure; its only attribute is its position. Fur- 
thermore, to avoid distractions with mathematical technicalities (which might, 
in other contexts, be very relevant) we will restrict the positions of the particle 
to sites on a discrete lattice. We emphasize that these simplifications are not 
necessary. The generalization to other systems involving more complicated con- 
figuration spaces is straightforward. If one wants to describe a particle moving 
in a continuum the modifications are rather trivial, a mere replacement of sums 
by integrals; the case of a quantum field theory might not be as easy, but in 
principle it should be doable as well. 

In section 2 we consider various idealized experimental setups which will test 
whether a particle moves from an initial starting point to a final destination 
point. The use of these setups defines what statements or propositions about 
the particle we are allowed to make. No mention of observables beyond position 
is ever made, but there is a possibility of combining simple setups into more 
complex ones. This is described by introducing two operations, which we call 
and and or, that allow us to construct complex setups (or propositions) from 
simpler ones. 

At first sight this approach to quantum theory might resemble P-thp; 



iomatic approaches. For example, in the quantum logic approachl 
propositions are also defined operationally in terms of the setups that will test 
them. But there are major differences, for example, an operation of central 
importance in quantum logic is that of negating a proposition. In our ap- 
proach negation is never introduced. A comparison with the complex probability 
approach 113 also shows a similarity which on further analysis is, again, proved 
superficial. Unlike the latter theory, our and and or operations are not the usual 
Boolean ones, although they do enjoy a sufficient measure of associativity and 
distributivity to justify their names. In fact, the set of statements allowed here 
is much more restricted than in either of the two approaches mentioned. 

In section 3 we seek a quantitative representation of the and and or opera- 
tions, i.e., a representation of the possible relations among various experimental 
setups. This is done by assigning a complex number to each setup in such a 
way that relations among setups translate into relations among the correspond- 
ing complex numbers. The assignment is highly constrained by a consistency 
requirement, expressed in terms of functional equations, that if the complex 
number associated to a given setup can be computed in more than one way, the 
various answers must agree. Solving the consistency constraints shows that all 
representations of and and or are actually equivalent to each other and that 
there is a particular choice that is singled out by its convenience. With this 
choice the and and or operations are represented by the product and_siim of 
complex numbers, i.e., the product and sum of probability amplitudes. a 

After introducing, in section 4, the concept of a state described by a wave 
function, we show how the product and sum rules imply the linearity of the 
Schrodinger equation. Then we address the issue of the physical interpretation 
of the formalism, that is, of how probability amplitudes are to be used in the 
prediction of outcomes of experiments and, in sections 5 and 6, we give a proof 
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of Born's statistical postulate. Final comments appear in section 7. 

2 What can we say about a simple particle? 

Suppose the only experiments we can perform are those that can detect the 
presence or absence of the particle in a sufficiently small region of space-time 
around an event x = (x,t). Later, in section 6, we will argue that this is not 
as restrictive as one might at first think. The propositions in which we are 
interested will typically describe motion. The simplest statement of this sort, 
"the particle moves from Xi to Xf," which we will denote by [xf,Xi], can be 
tested by preparing a particle at Xi and placing a detector at Xf. Our eventual 
goal is that of predicting the likelihood of a positive outcome of a test of [xf,Xi\. 

To analyze this problem we consider placing various obstacles in the path of 
the particle. This leads to propositions involving various constraints intermedi- 
ate between Xi and Xf. Consider, for example, "the particle goes from Xi to Xf 
via the intermediate point x\ (we assume that ti < t\ < tf) or, equivalently, 
"the particle goes from Xi to x\ and from there to x/". This we will denote 
by [xf,x\,Xi\. How could we test this proposition? We will certainly have to 
prepare the particle at the starting point Xi and place a detector at the final 
destination point Xf, but we cannot place a second detector at x\ \ our particle is 
a delicate microscopic object, and our detectors are clumsy macroscopic devices 
that will totally alter the nature of the motion. All detections should be kept 
to the bare minimum: just one detection at the final destination point. 

To carry out a test of [xf, x\,Xi] we will imagine an experimental setup with 
a source at a;*, a detector at the final destination Xf and some sort of device 
which implements the constraint at x\. Needless to say, we deal here with a 
highly idealized conceptual device used as an aid for reasoning rather than for 
actual experimentation; such devices are not unusual in theoretical physics. We 
imagine first an extended obstacle which blocks all paths that the particle could 
have taken through some arbitrary spacetime region (see fig. la). This already 
represents a considerable idealization; a more realistic obstacle would have fuzzy 
edges, regions of partial transparency rather than total opacity, and so on, but 
let us nevertheless proceed. The complications due to the arbitrary shape can 
be alleviated by imagining our obstacle as a succession of simpler obstacles each 
operating at a single time (see fig. lb). The next step in idealization, shown 
in fig. lc, is one of these single-time obstacles of infinitesimal spatial extent: it 
blocks all the paths passing through the spacetime point x\. 

In fig. 2a and 2b we show the setups needed to test and [xf,Xi,Xi]. 

The obstacle that implements the constraint at x\ is the complement of the 
infinitesimal obstacle of fig. lc. This idealized device we will call a "filter." It 
suddenly appears at time ti, blocking the particle everywhere in space except 
for a small "hole" around the point x\, through which the particle may pass 
undisturbed. The filter lasts an infinitesimally short interval and then, just as 
suddenly, it disappears. The net result is that the filter prevents any motion 
from Xi to xf except via the intermediate point x\. If we want to impose 
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Figure 1: (a) A generic obstacle of irregular shape and duration is placed in 
the path of the particle from x% to x/. (b) The obstacle can be envisioned as a 
succession of idealized obstacles of irregular shapes but essentially no duration, 
(c) An obstacle of infinitesimal extent; it blocks all the paths passing through 
a given spacetime point. 

more constraints, as in [xf, X2, x\, Xi] ="the particle goes from Xi to Xf via 
the intermediate events x\ and X2," we introduce two filters, one at time t\ and 
another at ti with holes at x\ and xi respectively. The use of an infinite number 
of filters would allow us to specify completely the path followed by the particle. 

The conceptual device of using these idealized filters allows us to introduce 
yet another kind of setup or proposition. Suppose that instead of having one 
hole in the filter at t\ we open two holes, one at x\ and another at af^see fig. 
2c). This physical situation is one which one might classically describe as "the 
particle goes from Xi to Xf via point 11 or x[ ." Such a proposition we will denote 
by [xfjXijXijXi]. (We will generally use subscripts to label the times at which 
events or filters occur and superscripts or primes to distinguish events or holes 
which happen at the same time but at different locations.) Although it is not 
quite necessary, for the sake of clarity, we may wish to write [xf, (xi,x[),Xi\ 
where we have grouped together events which, being simultaneous, represent 
holes in the same filter. 

The propositions we will consider will all be of the general form 

a = [xf,s N , s N -i, . . . ,s 2 , s\,Xi] , (1) 
where s n = (x n ,x' n , x'^, . . .) denotes a filter at time t n , intermediate between ti 



5 



Figure 2: Examples of simple propositions: (a) "the particle moves from xt to 
Xf" , (b) "the particle moves from Xi to x\ and from there to a;/", and (c) "the 
particle goes from xi to Xt via x\ or a^". 

and tf, with holes at x n ,x' n ,x'^, . . . Statements such as [sn, sat_i, . . . , S2, s%, xi\ 
or [xf, Sjsr, Sjv-ii ■ • ■ ? s 2, si] are not allowed. Two propositions will be consid- 
ered equal when they represent the same experimental setup, i.e., the same 
distribution of filters and holes. 

Notice that equation (|l|) incorporates two crucial features of quantum theory: 
First, the allowed setups involve a single initial and a single final event where we 
can place a source and a detector. E3 This is a recognition that measurements 
and other interactions with macroscopic devices that induce uncontrollable dis- 
turbances must be avoided. Second, there is a one to one correspondence be- 
tween the allowed statements and the idealized experimental setups with which 
we could test those propositions: all propositions are testable. In fact, we are 
identifying propositions with setups; this is a recognition that no statements can 
be made about the particle by itself independently of the experimental context. 

From now on the words 'proposition' and 'setup' will be used interchange- 
ably. In fact, we prefer the latter and will use it more often because, first, its 
use helps emphasize that the goal is to find out whether the detector at Xf will 
fire or not. Second, by avoiding statements about the particle itself we hope 
to eliminate misconceptions about what the particle is and what it is actually 
doing between source and detector. We are not saying that the particle is either 
a point particle or a wave, or both, or neither. We are not saying that it went 
through either one hole or through another, or even that it went through both 
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holes at the same time. In fact, beyond the fact that the particle is capable of 
being emitted and detected we are not assuming much at all. 

In attempting to predict the result of tests it seems reasonable to assume that 
if two propositions are related in some way (one proposition might, for example, 
be testable using a part of the setup used for the other), then information about 
one should be relevant to predictions about the other. Our next step will be to 
exhibit relations of this sort. This will allow us to use simple setups to build 
more complex ones, and conversely, also to analyze complex setups into simpler 
ones. 

The basic relations that we wish establish are of two kinds. The first kind 
arises when two setups a and b can be placed in immediate succession. This 
results in a third setup, obviously "related" to the first two, which we will 
denote by ab. Notice that this operation, which we will call and, cannot be 
used to combine any two arbitrarily chosen setups a and b. It is only when the 
destination point of the earlier setup coincides with the source point of the later 
setup that the combined ab is an allowed setup; a and b must be consecutive. 
The simplest instance of this is 



and another example is shown in fig. 3a. In general, 

[Xf, SjV, • • • , X n ] [^ni Sn— li • ■ • j ^1, Xi] — [xj : S]\[ 7 . . . , X n: . . . , Si, Xi\. (3) 

Conversely, any proposition with a filter containing a single hole can be decom- 
posed into two consecutive propositions. In the left member of equation (^|) it is 
important that all ti, . . . , t n -i happen before t n and that i n +i, • • • > tN happen 
after t n , otherwise the two setups overlap and are not consecutive. 

The second useful kind of relation we consider arises when two setups a' 
and a" are identical except on one single filter where none of the holes of a' 
overlap any of the holes of a" . We may then form a third setup a, denoted by 
a' V a" , which includes the holes of both a' and a" . A simple instance of this 
operation, which we will call or, occurs in a "two-slit" experiment (see fig. 2c), 



[xf,Xl][xi,Xi 



[xf,Xl,Xi], 



(2) 



[Xf , Xi , Xj\ V [Xf, Xi, Xi] — [Xf , (X\, Xi") , Xj\, 



(4) 



and another example is shown in fig. 3b. The general case is 



[xf , . . . , s n , . . . , Xi] V [xf , . . . , s. 



. . . , Xj\ — \x f , . . . , s n , . . . 




(5) 



where 



n 




s: 
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(x n x' 
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Figure 3: Two examples of using and and or to construct complex propositions 
out of simpler ones. 



Again, notice that it is only for special choices of setups a and b that this or 
operation will result in an allowed setup a V b. 

The two symbols we have introduced, and and or, represent our presumed 
ability to construct more complex setups out of simpler ones. If one considers 
them as operations it is natural to ask if there are any rules that should be 
followed to manipulate them consistently. To obtain these rules we follow the 
principle, mentioned earlier, that two propositions are equal when they represent 
the same setup of filters and holes. The first rule is that the or operation is 
commutative 

ay b^ by a. (6) 

For the and operation, however, there is an asymmetry implicit in the idea of 
placing setups in succession, one setup is the earlier one. It is convenient to 
incorporate this feature into the notation: if ab is an allowed setup, 

ab ^ ba, (7) 

because ba is not allowed. 

Next, we can see that both and and or enjoy a certain amount of associa- 
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tivity. For example, given three consecutive setups a, b and c we can write 

(ab) c — a (be) = abc, (8) 



provided ab and be are allowed. In this case (ab) c and a (be) are automatically 
allowed but ac is not because a and c are not consecutive. Similarly, for the or 
operation we have 

(0V6) Vc = aV(6Vc) = a V 6 V c, (9) 



provided all four setups (a V 6), (6 V c), (a V 6) Vc and a V (6 V c) are allowed. In 
this case eq. (||) is also equal to (a V c) V 6. Notice that any differences between 
the three setups a, 6, and c must be found in one and the same filter. Otherwise 
(a V b) V c ^ a V (6 V c) because if the member on the left is allowed the one on 
the right is not. 

The last important rule is that of distributivity. This may take the form 

a(b V c) = (ab) V (ac) or (b V c)a = (ba) V (ca) . (10) 



Which equality holds, if any, will depend on whether the relevant propositions 
are allowed. Both equalities cannot hold simultaneously. 

An important illustration of the use and utility of the and and the or opera- 
tions arises from the observation that a single filter that is totally covered with 
holes is equivalent to having no filter at all. In other words, the absence of a 
filter at time t\ is a special kind of filter <j\ which may be freely introduced into 
any proposition (provided ti < t\ < tf). For example, using eqs. (^|) and (|J), 
we have 

[xf,Xi] = [xf,ai,Xi] = V w [x/,xi,Xi]= ([a;/,a;i][ari,a;i]), (11) 

allxi alla:i 



and, introducing additional a filters at times tg, • • • % we get 

[xf,Xi] = V •■■ V V ([xf,x N ]---[x 2 ,x 1 ][xi,Xi]). (12) 

&IIXN all £2 allxi 



This shows how motion over a long distance can be analyzed in terms of motion 
over shorter steps. 

One cannot fail to see some similarity between our quantum and and or 
operations with the Boolean operations and and or which also happen to be 
commutative, associative and distributive and are also used to construct more 
complex propositions out of simpler ones. But the similarity ends there: The 
quantum and and or introduced here are not logical but rather physical con- 
nectives, they are used to describe the relative dispositions of various pieces of 
equipment. Also, and perhaps more important, is the fact that the Boolean 
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operations (and this applies as well to the and and or introduced in quantum 
logics) can connect any two arbitrary propositions, while the conditions on al- 
lowed setups impose severe restrictions on the action of the quantum and and 
or. 

3 Amplitudes: the sum and product rules 

Our goal is to predict the outcomes of experiments and the strategy is to es- 
tablish a network of relations among setups in the hope that information about 
some setups might be helpful in making predictions about others. Our next step 
will be to obtain a quantitative representation of these relations. 

Suppose each setup a is assigned a complex number <j>(a). By a 'representa- 
tion' we mean that the assignment of cf)s is such that relations among physical 
setups should translate into relations among the complex numbers associated to 
them. Why should such a representation exist? It need not, but all of physics 
consists of representing elements of reality, or relations among these elements, 
or our information about them, by mathematical objects. The existence of such 
representations may be mysterious, but it is not surprising; there are too many 
examples. A second, simpler question is why do we seek a representation in 
terms of complex numbers? Again, no answer here; this is an unexplained fea- 
ture of quantum theory. It seems that a single complex number is sufficient to 
convey the physically relevant information about a setup. 

To be specific consider a "double-slit" experiment. The relation between 
[xf, (x\, x[), Xi] and its components [xf,X\,Xi] and [xf, x[, Xi] will be repre- 
sented as a relation between the complex numbers <fi(xf , (xi , x[ ) , x j ) , <fr(xf , X\ , x j ) 
and <p(xf, x[, x{) corresponding to them. What we require is that there exist a 
function S such that 

4>(x f , (x 1 ,x' 1 ),x i ) = S {4>{x fl x ll x i ) ^{xf^x^^Xi)) , (13) 

and that this same function apply to any other setups that are similarly related. 
More generally, if the setups associated to a and to a' are such that a V a' is an 
allowed setup then 

<j>{aVa') = S{<j>{a),<l>{a')). (14) 

Thus, the function S is a representation of the relation or. 

The requirement that 5* should exist is a strong constraint on the allowed 
assignment of 0s. Consider for example the number <f) assigned to a V a' V 
a" . Using associativity this can be calculated in two different ways, cither as 
<fi ((a V a') V a") or as <f) (a V (a' V a")). Consistency requires that the two ways 
agree, 

S {<f> {a V a') , 4> (a")) = S {(j) (a) , (a' V a")) . (15) 

Using S once again one obtains the following consistency constraint 

S(S{u,v),w) = S{u,S(v,w)), (16) 
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where we have let </> (a) = u, <f> (a 1 ) = v, and (j> (a") = w. 

One can check, by substitution, that the associativity constraint, eg. (|l6|) , is 
satisfied if 

S(u,v)=C 1 (Z(u) + ti(v)) or £(S(u,v))=i;(u)+t(v), (17) 

where,-£ is an arbitrary function. In the appendix we give a proof (similar to 
Cox'sO) that this is also the general solution. In other words, eq.(|l7j) tells us 
what forms the function S may take, and conversely, that if the function S exists 
then there must also exist another function £, calculable from S, such that 

£(<f>(a\/a')) = Z(<t>(a))+H<f>(a'))- (18) 

This is remarkable. It immediately suggests that instead of the original repre- 
sentation in terms of the complex numbers <f> (a), we should opt for an equivalent, 
simpler and more convenient representation in terms of the numbers £ (0(a)). 
In other words, the consistent assignment of complex numbers £ (a) to proposi- 
tions a can always be done so that the or operation is represented by a simple 
sum rule, 

£ (a V a') = £(<*)+ £(<*')• (19) 

In this representation S is addition. 

Next we turn our attention to the representation of the and operation. From 
this point onward Cox's treatment and ours differ. Cox focused on the oper- 
ation of negating a proposition, and was thus led to consider the consistency 
requirement ensuing from the possibility of double negation. Negation is not 
an operation available to us, we rather choose to concentrate on the associative 
and distributive properties of and. 

Consider for example, a particle that goes from an initial to a final Xf via 
an intermediate point x. We want to represent the relation between [ 
and its components [xf, x] and [ relation between the complex numbers 

£ (xf,x,Xi), £ (xj,x) and £(x,Xi). We then require that there exist a function 
P such that 

£ (x f ,x, Xi) = P{£(xf,x),£ (x, Xij) , (20) 

and that the same function P apply to any other propositions that are similarly 
related. Specifically, if ab is any allowed proposition we require that 

aab) = P (a), £(&)), (21) 

so that the function P is a representation of the and operation. 

The functional form of P is highly constrained by the requirement that P 
should exist for any a and b such that ab is allowed. We can repeat the argument 
we used earlier for the or operation: the number £ associated to the (allowed) 
proposition abc can be computed in two ways, either as £ ((ab) c) or as £ (a (be)), 
and these should agree. Therefore, P must satisfy the associativity constraint 

P(P(u,v),w) = P(u,P(v,w)). (22) 



11 



Furthermore, and and or are not unrelated: using distributivity, the number £ 
associated to the (allowed) proposition a (b V c) can also be computed in two 
ways, either as £ (a (6 V c)) or as £ ((afe) V (ac)). Therefore, using eq. (pj|) , 

P(£(a),£(6Vc))= £( a 6)+£(ac), (23) 

and, using P and 5 once again, we conclude that left distributivity leads to the 
following constraint 

P{u,v + w) = P(u,v)+P(u,w), (24) 

where £ (a) = it, £ (6) = u, and £ (c) = w. Similarly, from propositions of the 
form (a V 6)c for which right distributivity holds we obtain 

P(u + v,w)=P(u,w)+P(v,w). (25) 

The solution of these distributivity constraints is trivial. Differentiating 
eq.(53) with respect to v and w and letting v + w = z gives 

^P(u,z)=0, (26) 

so that P is linear in its second argument, P(u, v) = A(u)v+B(u). Substituting 
back into eq.(]24]) gives B(u) = 0. Similarly, cq.(p5|) implies that P is linear in 
its first argument, therefore 

P (it, v) = Cuv or £ (ab) = C£ (a) £ (6) , (27) 

The associativity constraint, eq.(^2|) is automatically satisfied. The constant 
C can be absorbed into yet a new number ip (a) = C£ (a), so that the and 
operation can be conveniently represented by a simple product rule, 

i/> (ab) = V- (a) V (6) , (28) 

while the sum rule remains unaffected, 

rl>(a\/b) = ip(a) + il>(b). (29) 

Complex numbers assigned in this particularly convenient way will be called 
"amplitudes" . 

Let us summarize the results of this section: A quantitative representation 
of the relations between setups can be obtained by assigning a complex number 
ip(ct) to each proposition a. Because of the crucial requirement of consistency 
the considerable arbitrariness in the actual choice of ijj(a) is largely illusory; it 
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turns out that all representations are equivalent to each other, i.e., they are 
obtained from each other by mere "changes of variables." Although all consis- 
tent assignments are equally correct in the sense that they serve our purpose of 
providing the desired representation, some are singled out by their sheer conve- 
nience. They lead to representations of the quantum and and the or operations 
that take very simple forms: products and sums. This is the central result of 
this paper. 

4 Wave functions and the linearity of time evo- 
lution 

Amplitudes have been introduced in the last section as the natural concept 
to describe experiments quantitatively but we have not yet indicated how the 
knowledge of an amplitude is to be used in predicting the outcomes of experi- 
ments. In order to suggest, in the following section, how amplitudes are to be 
interpreted, we will first explore, along conventional lines, H3 the properties of 
the amplitude ip(xf,Xi) associated to the basic proposition 

We had seen earlier, in eq. how to analyze a motion from Xi to xj in 

terms of motion over shorter steps from Xi to x and from there to Xf. Now we 
can express this in terms of probability amplitudes; using the sum and product 
rules, we get 

ip(xf,Xi)= ^2 ip(xf,x,Xi) = ^2 ip(xf,x)i/j(x,Xi), (30) 

all x at t all x at t 

The sums are a consequence of restricting the positions x to sites on a discrete 
lattice; the generalization to a more realistic continuum where the sums are 
replaced by integrals is straightforward. Equation ( |30| ) or, more explicitly, 

ip(x f ,tf,x l ,t i )= ^ i>{Sf,tf,x,t)ip(S,t;xi,ti), (31) 

all x at t 

describes time evolution and therefore holds the key to the question of the 
physical interpretation. To see this it is convenient to introduce the notion of a 
state described by a wave function. 

Suppose that a particle starts at (xi,ti) and prior to time t it undergoes 
various interactions the net result of which is that the amplitude to reach the 
point x at time t is given by ty(x,t). Of course, ^(x, t) is numerically equal to 
ip(x,t;Xi,ti), but situations are common where we know ^(x, t) and either we 
have no interest or have lost track of what happened before t. In these cases 
we may streamline the notation and replace ip(x } t; Xi,ti) by ^(x, t). Using this 
knowledge of ^(x, t) we can calculate ip(xf,Xi) directly. From eq. (p0[), 

ij(xf,Xi) = ^(xf,tf)= i>{xf,tf,x,t)^{x,t). (32) 

all ctf at t 
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The function ^(x, t), called the wave function, represents all those features of 
interactions previous to time t, that are relevant to the prediction of evolution 
after t. We might, by abuse of (classical) language, say that VE" describes the 
state of the particle at time t, and that the effect of those interactions prior to 
t has been to prepare the particle in state '5. 

Let's return to the description of time evolution implicit in eq. (p2|). Differ- 
entiating with respect to tf and evaluating at tf — t we get 



d^(xf,t) dip(xf,t';x,t) 

all x at t 



dt 4^ dt' t'=t 



The derivative on the right is a function of Xf, x, and of t. If we define 

dip(xf,t'; x, t) 



dt 1 

then, 



-H(x f ,x, i), 



ih ^W^ = ? H(2 f) S,t)*(2,t), (33) 

all xatt 

which is recognized as the Schrodinger equation. We might not yet know what 
\1/ means, nor what should the Hamiltonian H be, but we have obtained an 
important result: once certain natural consistency requirements are accepted, 
the time evolution of quantum states is given by a Schrodinger equation which 
is necessarily a linear equation. 

The conclusion is clear: Non-linear variants of quantum mechanics that 
preserve the notion of amplitudes violate natural requirements of consistency. 
The question of whether it is possible to formulate non-linear versions of quan- 
tum mechanics should not be formulated as a dynamical question about which 
non-linear terms one is allowed to add to the Schrodinger equation, but rather 
it should be phrased as a deeper kinematical question about whether quan- 
tum mechanics should be formulated in terms of mathematical objects other 
than amplitudes and wave functions. However, whatever the nature of those 
mathematical objects the requirement that they be manipulated in a consistent 
manner should be maintained. 



5 Physical interpretation: Born's statistical pos- 
tulate. 

Having established rules for the consistent manipulation of amplitudes we can, 
finally, address the question of how to use them to predict the outcomes of 
experiments. The key to finding a physical interpretation for wave functions or, 
equivalently, for amplitudes, is the time evolution equation (eq. |3(] or ^). We 
reason as follows: 



14 



Consider first a special case. Suppose that as a result of a very special 
preparation procedure (between times U and t) the wave function ^(x,t) at 
time t vanishes everywhere except at a single point Xo, 

*(x,t) = AS SiSo . (34) 



Next, place a filter at time t with a single hole at xq. It is easy to see (from eq 
|30|or - 



32) that the presence or absence of this filter has absolutely no influence 
on the subsequent evolution of the wave function or on the amplitude to arrive 
at any final destination point Xf. Since relations among amplitudes are meant 
to reflect corresponding relations among setups, it seems natural to assume that 
the presence or absence of the filter will have no effect on whether detection at 
Xf occurs or not. This, in turn, suggests that even in the absence of the filter, 
at time t the particle must have been at xq and nowhere else. Therefore the 
special case where ^{x, t) cx 5^^ be interpreted as "at time t the particle is 
located at x ." 

Essentially the same argument can be applied in a variety of other cases. For 
example, if as a result of the preparation procedure ^(x, t) vanishes at a certain 
point x' then placing a filter with holes everywhere except at x' should have no 
effect on the subsequent evolution of Therefore \t (x 1 , t) = is interpreted as 
"at time t the particle is not at x'." 

This leads naturally to the following general interpretative postulate: Con- 
sider a filter the action of which is to block out those components of the wave 
function characterized by a certain feature (what that feature is should be obvi- 
ous to whoever built the filter). Suppose that at time t the system is in a state 
of wave function ^ (t). If the introduction or removal of the filter at t has no 
effect on the future evolution of the wave function then the rule of interpretation 
is that the system in state \1/ (t) does not exhibit the feature in question. The 
rule applies to amplitudes in general. 

The power of this rule becomes apparent when applied to a particle with a 
generic wave function 

V{x,t) = J2 A i 6 z,3 i > ( 35 ) 

i 

where the number and location of the XiS is arbitrary. We want to predict the 
outcome of an experiment in which a detector is placed at a certain Xk- If Xk 
differs from all of the XiS in the sum in eq. ([35|) the interpretative rule directly 
implies that the particle will definitely not be detected. 

The interesting problem arises when Xk coincides with one of the XiS. In this 
case, as expected, one cannot predict the actual outcome of the experiment. 
What is predictable, and quite precisely in fact, is the probability of various 
outcomes. At this point we will make an important assumption about the wave 
function: we will assume that it can be normalized. For convenience we will 
from now on assume that 'J in eq. (pq) has been appropriately normalized, 



*f = *) = ]T |*(f,<)| 2 = £ |^| 2 = 1. (36) 
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Next we show that the probability of detection at Xk is \Ak\ 2 - Thus Born's sta- 
tistical interpretation is actually a theorem: it follows from a simpler interpre- 
tative rule that only refers to situations of absolute certainty. This remarkable 
fact was discovered long ago by Finkelstein,1i§ by HartleH and by Graham. O 
The proof below is particularly suited to the approach to quantum theory being 
developed in this paper. 

Consider an ensemble of N identically prepared, independent replicas of our 
particle; later we will take N — > oo. In the next section we will show that the 
wave function for this ./V-particle system is the product 

N 

*jv(xi,...,xjv,t) = JJ *(x a ,i) = ^ A il ...Ai K 6g 1> 2 ii ...8s N> z iif . (37) 

a— 1 ii...tjv 

Suppose that in the iV-particle configuration space we place a special filter, 
denoted by P%, the action of which is to block all components of except 
those where exactly n of the N replicas are at Xk ■ The wave function right after 
this filter is 

Pn^N = S n ^ k A il ...A lN S SuSii ...8s N . StN , (38) 
ii...in 

where 

N 

nk = Y8k,i a - (39) 

Q = l 

Actually this filter is too strict, it selects a single sharply-defined fraction / = 
n/N. What we need is a more lenient filter (presumably built by opening 
additional "holes" in P^) that allows passage of all fractions in a range from 
/ — e to / + e. The action of this filter is described by 

/ (/+e)iV \ 

p 1^n=y. J2 ^ ■ ( 4 °) 

il—i N y n= (/_ e )AT J 

We are now ready to apply our interpretative rule: If, as N — > oo, the pres- 
ence of the filter P£ e is found to have no influence whatsoever on the future 
evolution of the wave function Wjv we will interpret as representing a state 
for which the fractions of replicas at Xk must lie in the range from / — e to / + e. 
To show that this is actually the case we must show that as N — ► oo the wave 
function right after the filter, Pj^vPjv, becomes more and more similar to the 
wave function right before the filter, $ [j. To this end we calculate the norm 

|j^ e ^jv — *jv|| 2 = \Pf,^N-t>N\ 2 ■ (41) 

X\...Xn 

Since filters act as projectors, PP = P, we get 

\\P^n - *at|| 2 = 1 - (^n,P^n) ■ (42) 
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The calculation of the scalar product is straightforward, 

(/+e)JV / \ 

(* N , p^ n ) = e_ pI^n -EE 1^1 1 2 • • • i 2 • 

X1...XN n=(f~e)N \ii...i N / 

(43) 

The sum over . is done as follows: Suppose we satisfy the Kronecker 
&n,n k constraint by choosing n of the N indices . .,ijv and setting them to 
the value k. Since the individual ^fs are normalized each sum over the remaining 
N — n indices gives 

EMM 2 = 1-MM 2 . (44) 
But there are (^) ways to choose which n indices are set equal to k, therefore 




For large N this binomial sum tends to the integral of a Gaussian, 

(* N ,Pf te * N ) = [ t+S -±^exp( ^' t ~P )df. (46) 

with mean / = \A k \ 2 and variance o 2 N = f(l — f)/N. In the limit TV — > oo this 
is more concisely written as a 5 function, therefore 

Urn \\P} e * N -* N \\ 2 = 1- [ t+ S(f-\A k \ 2 )df. (47) 

N^cc J > J f _ e 

The interpretation is clear: as N — > oo the filter will have no effect on 
the wave function ^jv provided / lies in a range 2e about |j4fc| 2 , and according 
to our interpretative rule ^ m cannot contain any fractions outside this range. 
Choosing stricter filters with e^Owe conclude that as N —> oo, ^ n is a state 
for which the fraction of replicas at Xk is exactly |Afe| 2 . 

Returning to the original single particle system, we see that we cannot pre- 
dict whether a detection at x k will occur or not. In fact, we have a very definite 
prediction of indeterminism: for large N detection will certainly occur for a 
fraction |^4fe| 2 , and equally significant, detection will certainly not occur for a 
fraction 1 — \A k \ 2 . Once the assumption is made that the relations among possi- 
ble experimental setups are quantitatively represented by consistently assigned 
amplitudes, the general interpretative rule implies indeterminism. The best one 
can do is assign a probability p to this detection. Given the information that 
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for a large number of identically prepared systems the fraction of successful de- 
tections is \Ak\ 2 the only assignment consistent with the law of large numbers 
is the value 

P=\A k \ 2 . (48) 

To complete our proof of Born's postulate we must prove that the wave function 
for N independent particles is the product of the wave functions for each one of 
the particles. This is the topic of the next section. 

6 Several independent particles 

We want to show that the wave function of a system a[3 composed of two 
independent particles a and (3 is the product of the wave functions for each 
particle, ^ a /3 — ^a^p- In the spirit of the previous sections the first step must 
be that of defining the statements about composite systems in terms of the 
experimental setups designed to test them and of providing a representation in 
terms of amplitudes of the relations among those setups. 

Our system is composed of two independent particles. The notion of in- 
dependence imposes highly non-trivial constraints. Suppose that the allowed 
propositions about particle a by itself and the corresponding setups designed 
to test them are denoted by a, as in cq. (1), and similarly that the allowed 
propositions and setups about particle (3 are denoted by b. Then the first con- 
dition implied by independence is that the statements c about the composite 
a/3 are restricted those that can be tested by composite setups that separately 
test a about a and b about j3. Thus the setups allowed for a(3 are of the form 
c = {a; b}. 

The various ways in which the composite setups c can be combined can be 
derived from the various ways in which the as and the 6s can be combined among 
themselves. Thus, if C\ = {ai;&i} and c 2 = {02^2} and if a\ V a 2 is allowed 
and bi = b 2 then we define or by qVc 2 = {cii V a 2 ,fei}. On the other hand 
if it is 61 V &2 that is allowed and a\ = a 2 then c\ V c 2 = {ai, b\ V 6 2 }. In the 
general case c\ V c 2 will be an allowed setup only if c\ and c 2 differ in one and 
only one filter; if the b setups differ then the as must be identical and vice versa. 
Similarly, if both aia2 and b\bi are allowed then we define c\C2 = {aia2, 6162}- 
Commutativity, associativity and distributivity for the and and or relations 
among the c setups follow from the corresponding properties for the a and b 
setups. The argument of section 3 can now be repeated: the relations among 
different composite setups can be conveniently represented quantitatively by 
assigning a complex amplitude tp(c) to each setup c in such a way that the sum 
and product rules hold. 

The second crucial condition implicit in the notion of independence, one 
that goes beyond the mere capability of independently placing filters in the 
paths of a and /?, is the requirement that changing the filters acting on a, i.e., 
changing a to a' shall have no influence whatsoever on the outcome of 6, and 
vice versa. Since relations among amplitudes are meant to reflect corresponding 
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relations among setups, and the physically relevant information about setups a 
and b is contained in ip(a) an d ^(b) this second condition can be quantitatively 
expressed by the requirement that the physically relevant information about 
setup c, expressed by ip(c) be some function of ?p(a) and ip(b) and nothing else. 
Thus, 

^(c)=-PWa),V(&)). ( 49 ) 

This is what we mean by independence. 

The function F is determined from the fact that not only tp(a) and ip{b) but 
also ij)(c) must satisfy the sum and product rules. For example, if ci = {on; &i} 
and C2 = {02; 62} and if a± V 0,2 is allowed and b± — &2 then ip(ci V C2) = 
ip(ci) + 1^(02) implies 

F (i/j (01) + V (09) , V (61)) = F(i/j (a\) ,ip (61)) + F(i/) (oa) , V (61)) , (50) 
so that 

F (u + u, u>) = -F (it, iu) + F (u, to) . (51) 

Similarly, if ^ V &2 is allowed and ai = a2 then 

F{u,v + w) = F(u,v)+F(u,w) . (52) 

Finally, if C1C2 = {aia2j^i^2} is allowed, from the product rule ^p{c\C2) — 
ip{ci)i/)(c2) we get 

F (uv, wz) = F (u, w) F (d, z) . (53) 



Equations ([jT]) and fl52| ) are formally identical with equations (24) and (psj). 
Therefore F(u, v) = Cuv. Substituting into (|53| ) we get (7=1, -F(u, v) = itf. 

Therefore, if particles a and /? are independent, the amplitude associated to 
c = {a; 6} is the product of the amplitudes associated to a and to b, 



(54) 



The generalization to TV independent particles is straightforward. 

7 Conclusions and some comments 

Let us summarize our main conclusions: After having identified a restricted set 
of allowed propositions in terms of the experimental setups designed to test them 
we introduce amplitudes as the essentially unique tool to carry out consistent 
calculations. The rules of manipulation are necessarily such that time evolution 
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is described by a linear Schrodinger equation and that the Born probability 
interpretation holds. 

It might seem surprising that a substantial amount of quantum mechanics 
has been reproduced without any mention of commutation relations among in- 
compatible observables and of the corresponding uncertainty relations; in fact, 
we have only discussed the measurement of position. However, observables other 
than position are useful concepts in that they facilitate the description of ex- 
periments and the manipulation of information. Although they have not been 
central in this formulation of quantum theory it may be worthwhile to remark 
on how they arise. In general, they originate from the idea that by placing var- 
ious filters, diffraction gratings, magnetic fields, etc., prior to the final position 
detection at x/ one can effectively build a more complex detector. This raises 
two questions: the first is what does such a complex device actually measure. 
The answer O is that what is measured is the extent to which the actual wave 
function 'F resembles another wave function <E> which is a characteristic of the 
detector. Should the resemblance be complete the particle would be detected at 
Xf with absolute certainty. The second question is what properties are we ac- 
tually interested in measuring. Typically, interesting measurements will result 
in information useful for prediction in other experiments, and foremost among 
these is the measurement of properties that have some lasting value, i.e., con- 
served quantities. A related question is that of deciding how the Hamiltonian 
should be cheeexu this choice, like that of most other observables, is dictated by 
symmetries 113' E3 and will not be further pursued here. 

For clarity we have focused our attention on the special case of a single par- 
ticle moving in a discrete lattice. But it should be clear that the argument can 
be generalized to considerably more complicated configuration spaces. The cru- 
cial feature is to identify the relevant propositions or equivalently, the idealized 
experimental setups designed to test them, and verify that the appropriate rules 
of associativity and distributivity hold. It is of some interest that in order to 
implement the associativity constraint one requires a configuration space which 
consists at least of three values. Remarkably, a similar restriction to spaces of 
three dimensions or more appears also in the work of Gleason.O This does not, 
of course, represent a problem: two-valued configuration spaces are unphysical. 
For example, realistic spin- 1/2 systems also have translational and a variety of 
other degrees of freedom. 

It is possible that the use of more complex detectors (i.e., other observ- 
ables) will permit extending the set of allowed propositions. Perhaps this 
would bridge jthej-gap between the present formulation and the quantum logic 
approach. H'EJ'BJ However, whether such an extension is advantageous is not 
at all clear. It would surely spoil the distributivity property (of and and or) 
which has played such a crucial role here. 

Many are the questions left open. Most, like the application of quantum 
mechanics to the detectors themselves, as well as to other macroscopic objects, 
the nature of the classical limit, and other issues associated with decoherence 
through interaction with the environment, are common to all approaches to 
quantum theory. But some questions seem more urgent in this formulation. A 
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particularly glaring one is: why complex numbers? Perhaps other mathematical 
objects with the appropriate associative and distributive algebras (e.g., quater- 
nions, multivectors, etc.)t3 could be used to obtain representations of the and 
and or operations. 

We conclude with a couple of brief comments. The first concerns the similar- 
ity between quantum motion and Brownian motion, or between the Schrodinger 
equation and the diffusion equation. There is a natural reticence to dismiss it as 
a mere coincidence, and it has been suggested that perhaps there is some under- 
lying stochastic physical agent responsible for the peculiar features of quantum 
motion. Our results suggest that such a physical agent need not exist, that the 
similarities between quantum and Brownian theories arise from formalisms that 
are strongly constrained by similar logical requirements of consistency which 
force one to manipulate amplitudes in one case, and probabilities in the other, 
using similar sum and product rules. 

The second comment addresses another aspect of the robustness of quan- 
tum theory, its universality. Quantum mechanics applies to a wide variety of 
systems over a broad range of energy and distance scales. Were new exotic 
objects (say, new excitations in condensed matter, or new particles, or strings) 
to be discovered, could we expect them to be described by quantum mechanics? 
Classical mechanics fails at atomic scales; how short a distance can we go and 
still expect quantum mechanics to hold? Or, in other words: What are the 
accepted features of today's physics that could reasonably be expected to hold 
in the future physics of objects that are yet to be discovered, or of energy and 
distance scales that are yet to be explored? It seems natural to assume that 
any list of such features should prominently include those derivable from mere 
consistency requirements; linearity and indeterminism are likely to be among 
them. 

A Solution of the consistency equations 

Our approach to solving the associativity equation (|l6|), 

S (S (u, v) ,w) — S (u, S (v, w)) , (55) 



is essentially that due to Cox.tJ A minor difference is that we deal with com- 
plex rather than real variables. Let r = S(u,v), s — S(v,w), Si(u,v) — 
dS (u,v) /du, and S2(u,v) — dS (u,v) /dv. Then eq.(|55|) and its derivatives 
with respect to u and v are 

S (r, w) = S (u, s) , (56) 
Si(r,w)Si(u,v) = Si(u,s), (57) 

and 

Si(r, w)S 2 {u, v) = S 2 (u, s)Si(v, w). (58) 
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Eliminating Si (r, w) from these last two equations we get 

G(u, v) = G(u, s)Si(v, w). 



(59) 



where 

S 2 (u,v) 

G{u,v) = — -. (60) 

Si(u,v) 



Multiplying eq.(|59|) by G(v, w) we get 

G{u, s)G{v, w) = G{u, s)S 2 (v, w) (61) 

Differentiating the right hand side of eq. ( |6l| ) with respect to v and comparing 
with the derivative of eq.(|59|) with respect to w, we have 

Q Ci O 

— (G («, s) S 2 (v, w)) = g^(G (u, s) 5i («, w)) = g^(G («, «)) - 0. (62) 
Therefore 

^(G(«,«)G(t; > t£;))=0, (63) 

or, 

1 flg(»,t,) 1 Mfr.u,) 

G(u,v) ov G(v,w) ov 

Integrating, we get 

G(u,v) = G(u,0)exp [ h(v')dv' , (65) 
Jo 

and also 

G(v,w) =G(0,w) exp - [ h(v')dv', (66) 

Jo 



so that 



GK»)-^, (67) 



where c = G(0, 0) is a constant and 

H(u) = exp - [ h{u')du'. (68) 
Jo 

On substituting back into eqs.(^9|) and ( |6l| ) we get 

Sl {v,w)=^\ and S 2 (v,w)=c^r. (69) 
H[v) H(w) 
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But s — S(v,w), so substituting ( p9[ ) into ds = Si(v,w)dv + S2(v,w)dw we get 

ds dv dw . . 

CT77-T. (70) 



H{s) H(v) H(w 
This is easily integrated. Let 

«<"> = «<»> + /„ THyy (71) 

so that du/H(u) — d^(u). Then 

t(S(v,w))=t(v)+ct(w), (72) 

where a constant of integration has been absorbed into £ (0). Substituting this 
function £ back into eq.(|55|) we obtain c = 1. This completes our derivation of 
eq.(Hl). 
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